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Method for Purifying Proteins and or Biomoiecule or Protein Complexes 

Description 

^ne present invention relates to a method for punf^'ing suostances sucn as 
ciomofecu:es, orote'^s, protein and. or biomoiecule comoiexes, subunits c' 
biomoleouie con^p^exes. cell components, ce'l organelles or even wnoie 
cells. It also concerns fusion proteins for use in this n-:etnod and other 
t C ^eiated subjects. 

^rotein expression and purification methods are essentia': for stuoying ine 
structure, activities, interactions witn other oroteins, nucleic acids etc. o* 
orotems of interest. Methods that are currently available use system.s sucn 
15 as bacteria or cells transfected with expression vectors or infected witn 
oacculovirus. 

In orcer to study individual proteins a first recu:rement ;s to cota:n su*T;cie^: 
am.ounts of that particular orotein to oe able to carr/ out biolcgicai anc 

20 biochemical analyses such as activity tests, interaction assays, structu-c 
determination and the like. For th:s purpose the genes coding fo^ the 
proteins of interest are cloned into vectors tnat ai^ow the expression c' 
tnose proteins in suitaoie nost cells. Usually tne or-oteins are exc;^essea a: 
high leveis. This ove^-expressicn leads to the generation o' large amounts 

25 of protein cut often has tne cisacvantage of vieidmg msoiubie protein v;hich 
is present in so-cailec nciusion bodies tn the ce'is. "^he o ver-exp-essec^ 
protein tnen nas to oe reso lubilized oefcre analysis. Aithougn such metnods 
■AorK vy^'' cc^\'e^:o^a: cm:e detect^c -^etnocs based on v.e.g^: 
analysis iccivacryiam.ice ge's. Vvestem. blots, etc.^ of t-e excresseo protei- 

3C tnev are no: su:tao:e ctne^ stuc es arc *cr assays on o^ote:n oomoiexes. 



C'J^e"*:-.- used prctccois f c c . er-e x p ress.r-, g cote. "5 are nz:^a\\: za^nez 
,., . ,^ ~.cs* cells in v.r.,zn tre prc.ein of interest ;s -~s-t ncrn-ia.lv expressed 
fo- exe — c'e v.'hen s^kai-yctic crcteins are expresses: in Pastena. Aca't frc"- 
tne insciubility, proteins excressed m tn;s n^.anne- often show a lack c" 
5 proper post-transcr:pt:onai cr post-transiationa. n^.oa.Tioation, sucn as ccect 
processing cr glycosylation . 

Another disadvantage is that the exoress:o- of one subunit of a ccmoiex 
even in a homologous syste~. nnight not lead to increased production of a^ 
10 the com.plex components and in some extrem.e cases can even result i-- 
mistargeting of that protein of interest to an aosrrant com.Die< (e.g. the 
proteasome, SwafielO et a'.., 1996, Nature 379. 558;. 

Purification of proteins expressed at the;^ basal level is therefore 
15 indispensable in many cases but appropriate purificat:on protocols are not 
easily available because of tne huge biochemical knowledge required to 
establish a suitable protocol. Developing a purification scheme requires 
assessing the chemical and cnysica! prooerties c' the target protein by a 
trial and error orocess, m.aking it tedious and tim.e-consummg because sue- 
20 analyses have to be repeated fc each protein. 

Affinity purification methoos for tne ou-^cat: = n c" proteins are known. Kits 
and appa-atuses for affinity cn^cmatography are comm.ercially available 
Usually, a fusio" protein of a poiypeotide of interest plus an affinity tag is 

25 exoressed m a system such as one c* tne above-m.entioned exoressio- 
system.s. the fusion protein is extracted and aooiied to support materia' 
such as a resir- matrix packed :n an affinity coiurr.n which is coatee with a 

miateria tnat soec: .ca... cin^_ . .. ■ ■ j 

orctem to tne mat'ix v,3 tne c^^mt; tag ^r-zz'^r^c suostances ca 

3C removea simply d washing tne mat- ... Tne fusion protein can t-c 
eiuteo of' t-e miat-> using cnem.cai ace-tc c soec-ic temperature : 
cn"c:fc-s S^co tne af'l-,:. t^gs us. .3: . n:g" a'-init . 5 



are neeae-a b„„bc^uc>.. .. -.^ - ^-^^ ^- - 

-2^5 possess grouos cr moieties which are capab'e of bir-cmg tc a 
scecific binding partner with high affinity. Various affin.ty tags are know- 
;n the art and have been wioeiy used for the purification of proteins, 
Examoles are the IgG bmd'-g donn.ains of protein A of Stapnyiococcus 
aureus, glutathione-S-transferase (GST), maltose binding orctein, ceiiuiose 
oinding domain, polyarg,n.ne, poiycysteine, Dolvhistidine , pciyphenyialanine 
calmodulin or calmodulin bina:ng domains. These bind with nigh affinity tc 
an aopropriate matrix which ,s covered with the specific binding partner. In 
tne case of protein A, IgG-coated sepharose has been used for affinit, 
chromatography of fusion proteins possessing a protein A domain (Senger 
etal., 1 998, EMBO J., Vol.17, 2195-2207). Other examples are discussed 
,n Sassenfeld, TIBTECH, 1990, p. 83. A plasmid vector containing a 
cassette encoding a calmodulin binding peptide is available from Stratagene, 

^,3,^3:,,^, fusion proteins are tagged with only one affinity tag and are 
purified in a single purification steo. This often leads to proolems due tc 
remain.ng contaminants. Another limitation of most of the convent.ona: 
methods IS tnat they are adapted for expression of the proteins in bacter:^ 
on'v. W095 40943 discloses a method of expressing fusion proteins 
gram-positive bacteria eit-c anchored to the membrane or ;n secreted 
Tne anchcrec orotems are cleaved off using TEV orotease and subset 



3 H i ^rr— 



tac. 



Often the affinity tag .s rem.ovec from the fusion protein after tne am 
ouri::cattor step cv tre act.on o:' a specific prctease such as the ' 



;tease. Tnis m.eans hc-.ve-.'e- tnat the pur.^e: 



substantial amounts of tnis protease : Senger et a.. 199S- 
,^,.3 -,^ni:--q-:n-s c- sue" prctem preparations. 
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it IS :re:--3'ore a- 
L j':fiC3t:or, detect.-: 
comolexes and c corp. 



cbiec* c- tP.e presert :.--.ention to prov-ce 3 
metr-od fc^ p-otems and c z:c-o\ezii:e cr p'Cte- 
ponents c subun:t5 tre^e^ .-.nicn eliminates tne 
the current!-, knov.n methods a-.a allows efficient 

5 as fL:s;on prote;"S O-t 



disadvantages 

purification not cni-/ affinity tagged target prot 
also of other substances that are capable of assoo:at -g -.vitn tnese p'oteins 



One methoc according to tne m-.-ention for Punh,^ng substances seiectec 
from proteins, biomclecules , complexes of proteins o^ d,o molecules , 
supun.ts tnereof, cellular components, cell organelles, and who^e cells 
comprises the following steps; 

(a; proviomg an expression environment containing one or more 
heterologous nucleic acds encoding one or r-cre polyDeptides anc or 
one or more subunits of a biomolecuie comoiex. tne polypeptides o^ 
subun.ts being fused to at least two different affinity tags, one of 
which consists of one or more IgG omding domains of 
Stapnyiococcus protein A, 

maintaining the expression en-.ironment .noe- conctions that 
facilitate expression of the one or more pcvceptides cr suPunits m 
a native form as fusion proteins with tne affinity tags, 
detecting and'or punfying the one or mc^e poivpeotides or suPunits 
b, a com.cmaticn o*' at ieast two differe-t a^-'mity purification steps 
each comprising binoing tne one or more po ceotices or suPunits v,a 
3..,^.,^, ,33 to a support materia; caoac e o' selectively binoing 
one C-: tne affmitv tags a-d separating tne one cr mce pclypect^des 
or supunits ^rcm the support matenai a^er s--cstances net bound to 
the support materia' have been removec. 



(b: 



!ci 



An alte^nat -. e --stncd or tne mve 
aetectmg and z' pu'ifvmg prctem cr pio 
comorismc f-^e steps: 



0 r', n 1 : 



; p -:'t:cu:3r: . SU:taE-:- 
1 e s c m p . e X e s is a met"": 
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3 providing an exc^ess'on erviro--e-: ccr,t3ip,:ng one z' -c-e 
heteroicgc js nucie : acics encocrnc 2t ;e3s: t.vc sjcj" :3 
3 cicmolecule con-D!ex eacn cemg "jsed to a: least one c 
different affinitv tags, one of which consists of one or n-ore 
igG binding donnains Staphylococcus orotem A 
• b-' maintaining the expression envircnnrient under conditions tnat 
facilitate expression c^ tne one or n-ore subunits in a native 
form as fusion proteins with tne a^fmitv tags, ano unoer 
cor- tions that allow the format'on o^ a complex between the 
one or m.ore subunits and possibiy otner com.ponents caoabie 
of complexing with the one or more subunits, 
' c detecting and, or purifying the com.plex ov a combinatic" of at 
least two different affinity purificaticn steps each comp-smg 
binding the one or m.ore subunits via one arinitv tag to a 
support material capable of selectively binding cne of the 
affinity tags and separating the complex from the supoort 
material after substances not bound to the support materia! 
have been removed. 

For tne pu'pose of this invention, a biomolecule can ce a protein, pectice 
nucleic acid or ether bicmoiecule. A biomolecule complex aenotes a 
•• at least two biomclecules . preferably at least one protein 
associatea eitner witn other protems which are then cailea subunits or witi-. 
otne^ substances whic" can for example be nucleic acids. The biomolecule 
cc"o^-2xes can be nature! ones sucn as nuclea^ snRNPs or antigen-antioccv 
comoiexes, o^ they can be artincia! ones such as mutant DNA binding 
O'oteins associated with mutant target :DNAs. Anv complex molecule 

aczolZ.r^2 tne :nve-t:o- and - :-r:ner comcr:s-3 otne^ co-ocne-- 
v.hic^ associate m a manner stanie e-ougn not :c ::e Gisscc:a:ec t^^ 
afnnm, 5:ens \s a ci on-oiocu!- ccn^n^eA .^.a. ^a.. 
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The nucleic acic secuence of tne prcteir tc oe cu' --.ec rrust be kno-.vr 
at leas: available sc that :: can oe c!oneC mtc a n-cleic acid which is 
suitable to drive expression in the appropriate nost ceils or cell-free 
exp.'-essicp systems, !f a p-ctem complex is tc be c-rified, the nucleic acid 
seauence o^ at least ore of its subunits has tc ce Kr^own or available. 

The heterologous nucie,- acia driving the expression of tne protein tc be 
purified according to the invention thus contains apcropnate sequences that 
allow It to be maintained in the chosen host ceil c ce:i-free system, such 
as 3 promoter and, if necessary, ether control sequences such as enhancers 
and poly A sites. 

in principle any host cell that is compatible witn tne heteroiogous nucleic 
acid from which the polypeptides or subunits are tc be expressed is suitable 
as an expression environment. These ceiis can oe oroKaryotic celis such as 
bacteria etc. cr eukaryotic cells such as yeast, fungi c mammalian cells. 
Preferably, the prctem or subunit or protein ccm.plex to be purified is 
expressed in its natural host. Since th's metncc is ve-/ efficient, the 
proteins are oreferably exo^essed at the;r basa' levels. This has the 
advantage of avoiding tne formation ircius:c-: codies and also reduces 
the risk cf toxic effects on the cell that large amounts cf certain proteins 
may have Furtnermcre, th:3 avoids our^fving excess orctem subunits that 
are net assembled into a complex c' tnat a- assembiec into aberrant 
com.piexes (see above''. 



After the neter-doccus n 
introduced into a cncse- 



As airea:2v' rr^eniioned, 'r.e transcriptional control seci^^ences are preierac . 
selected so that the fus:cr. prote.n :s not cver-exp-essed b-Jt is exo'eoseo 
at basal ieve:s in the ce.:. Tnis serves to ens-j-o that tne protein is excosseo 
m a native forrTi. Native form, means m tn:s context that a correct or 
relativeiv close to natura' three-c:mens,on a. str-.cture of the protem 
achieved, i.e. the protem is folded correct;.. More preferably, the protein 
will also 05 orocessed correctly and sho'.v normal oost-transcriotional and 
post-transiationai modification. The correct foidmg is o* great importance 
especia'^y v.hen the expressed polypeptide is a subunit of a protein complex 
because it will bind to the otner subunits of the complex only when it :s 
present in its native form. However, it is also possible to express mutant 
proteins. Tnese can also have a native conform.ation. Such mutant subunits 
can, for example, be used to purify mutant complexes, i.e. complexes tnat 
contain some other mutated subunits. 

Depending on the prote.n or subunit to be purified, the fusion protem is 
expressed intracellularly or secreted into the culture medium. Alternativeiv . 
It might be targeted to other cell compartments such as tne memorane. 
Depending on the protein an appropriate method is used to extract tne 
fusion protein from the cells and, or medium. When a fusion protein ;s 
expressed which is targetted to a certain subceiiular location, e.g. the 
memorane of cell organelles or the cell mem.brane. these organelles c tne 
cells tnemseives can be Dunfieo via tne ending o' these membrane 
proteins. It is also possible to puri-y cells or cell organelles via proteins 
naturallv exoressed on tnei' surface wmch D.nd to tne fusion protem cf t- 
invention. 



or otne- s-ostances tnat are capao^e c-o ng to or compiexing wi.n .. . 
fusion prote n generated according to tne invention, 'hese substances ca- 
binc to rus on r:roto,n either direct:v or :a ^m-.e^ meciatcrs. Linger mediator 



Ci-^r-^ciecu'es sc tnat trese Cicmoiecuies are r^en par: a corr>c:e\ 
, p^3w i:-? erectly 35scc;3ted w!t^ eaa^ c:"e^ 

Acccrc'-a tc the invention :t >s aiso possible to use :e''-free svsten-,s tc tne 
exo'essio- c' the poivpeptioes c sjbunits. ""hese ^.ust o'o-.ide ai: tne 
corrcenents necessary to effect expression of prote ns f^c" t"e nucleic acio 
s^'cn 3S transcription facte rs , enzvnnes . ribcsomes etc . m vit'c transcriptic- 
ano translation systems are commercially available as kits so that it is net 
necessa-v to describe these systems in detail (e.g. rabbit reticulocyte lysate 
system.s for translation!. A cell-'ree or m vitro system, snouid aiso aliovv tne 
form^ation of complexes. 

For tne purification according to the invention it is prefe-aoie to empiov 
affinity cnromatography on affinity coium,ns v-'hich contain a matrix coatee 
v.'ith tne appropriate binding partne- for the af*::nity tag used in that 
particular purification step. 

1" acccrcance witn the method o' this invention tv. c aff -^.f/ steps are 
ca-riea out. Bastcaliv eacn affinity step consists c^ a Dind:nc step in whicn 
the previously extracted orotem is bound via one of its afr'inity tags to 2 
supoort m.aterial which is covered with the approcriate bincmg partner fc^ 
tnat a-hmtv tag. Then unoound substances are rem.oved and finally tne 
protein to oe purified is -covered 'rom the suppo't m.atenai. This can be 
done IP, two ways. The firs: possibility is to simoiv use conventional eluticn 
tecnnicues such as varying tne c- or the salt or bu^fe^ concentrations and 
tne like depending on tne tag usee. Tne second possiciiity .s to release tne 
p^otem to oe purified from tne support m.atenal by croteoiyticaMy cieavmg 
c- tne af: -ity tag cc-c to tne support. Tnis wav tne crctem can ce 
reoo.ereG r tne 'z— z: z tru-catec ^usior. p-te^" Z' ' z a^^n-'. tag- 
nave been z ea.eo z'' as tr- target pcoeotice c sucu- t itself. 




^iore pre'eraoiy, a specific prctecivtic cleavage site .5 c^esent ;r; tne :u5,c" 
crotei-^ befAeen the cne 0^ rr:Ore poivceptides or s^cjnits and tne one 
more aff:nitv tags so that proteolytic cleavage aliows tne ren-.ovai of at leas: 
one cf tne affinity tags, especial'y the IgG binding c::^,a;ns of protem A, 



^oteoly: 



cieavaae can be earned cjt bv c'^.ennical n-.eans o^ enzvnnatic; 



The proteolovtic cleavage site that is used to cleave ct one o^ the a::\n\i\' 
tags IS preferaoiy an enzymatic cieavage site. Tnere are several proteases 
v^vhicn are higniy specific for short ammo acic seq^-ences which tney wii: 
cleave. One of tnese is a soecific cleavage site c' Tooacco Etch V;rus 
{TEV^, whic- is cleaved by the TEV protease NIA. Recon-.binant TEV 
protease :s available from Gioco BRL. Tne TEV cleavage site is pre'erao:v 
used as tne cleavage site to remove the o^c: 
protein . 



A Gcmiams from the fus: 



E e m o ^ e 0 " e * e r a b i \v the a f h ni t \' s t e d 'J s 1 n g c r o t e : A b 1 n a ; n g t ; 
earned out ''^st by binaing the one or mo^e ooivDeptiCes or subunits v;a tr^e 
one or m.o^e loG omdtng oomams of Stapnviococcus :o a support m.ateriai 



:apaDie cf snecific 



V cinamg the latten rem.ovmg s'^cstances not ecu 
to the suocort materia! anc separating t-^.e one cr m.cre pc;ypect:ce3 
subunits from tne supoc^t materia! oy cieavmg tne igG cincmg ccmc 
Via tne soec.hc p^otecT/t;^ c!ea.due 



^e^ a^^ 



usee to p'. 
binding t: 
SU C OC ^t : 



; n ■ 



^ t [ p ^ 



ie pciypeot.ae c suoumt via a^ctner a: 
r - * ^r--^' -^arable c * s o e c ; f ; c a I ' . 0 1 n c ; n g 



iatte^ re' 



^ , „ „ - ' • " - - • J ■ - a - s e c 3 ' 3 1 
■ s = net D 0 u n c - - - ■ ^ - ■ - • ■ 



\',r-;en tne proteins arc prese,., a. ^ji. 

er.ironment and on tne support matenai, a large amount of protease ;s 
reau.red to release the bound mater;al from, tne supoort matenai. cthe' 
words, v.nen the substrate concentration is low a high level enzvme ,s 
reouired to drive an efficient proteolytio reaction. Tne second purification 
ster is then important to remove remaining contaminants and the protease. 
Removal tne protease is preferacle in order to eliminate any negative 
influences of the proteolytic activity on the oreparaticn. 



However, in some cases it may be cesirable to remove ali the affinity tags 
,n which case ,t is also possioie to utilise two o.^ more different D'oteolytic 
cleavage sites for the separation of the polypeptide subumt of interest from 
the support material. 

r.^^ low^nr-n- HGt cnlv facilitates efficient 

ourification of proteins of interest but aiso allows fishing for and detecting 
components present ,n complexes with which the polypeptides or subunits 
are associated or complexed either directiy or indirectly , e.g. molecules such 

.. , -r^,- :.|i->v Qp'pctive fisnina for certain 

as itRKer meaiators. ihi:^ .^^^\^ an^w . 

u <^ ^-.w r^*3-T-:^i .^TL-s even fron^i ccmpiex nnixtures. 
substances whicn may ne co^e.-iiai o.^-^ c^^.- 

According to a second ennc cd -^e'^:: z 

d-pct or purify the s_bun,t containing fusion Proteins expressed but aiso 
other substances that are capaoie o' associating wit-^ tne proteins 

^ ^.r — 'w Kipr-mc to tne fusio^ prote;'^, c 
expressed m a direc. , i.e. ^ ^ ^ - ^ - 

mdsrectiv via otMer mo.ecuies .'^^ ^..mji..^- - 

. 1. r^r.^,*^-r ccmolex is DurineG 

prctem c' a subun;! c* a o;o. ^ ^ — 

^* t-o c*^-3 are chose-^ gc tnat c:ne^ 

accoromg to tne invention t.,e ■ 

con^n>ex ccn^pcne'^tc xv" "Xj . - ... ... ■ - 



e invent;cn ;t is ocssible net oni 



assoc. ated v;;:n tne s^c^ ^ ^ ^^-^ ' ^^--^ 

Tne o, = ^c:ec..;e co-.ciexes can be forn-.eo tne excession env,ror,~..en: 
such as ceil'jiar ccrr,o,exes. Ai:erp.ative: , , ctr.er zz-z:ex ccmpcnen:3 ~a; 
be added to the sun'jnits aireadv expressed to for:", compiexes in vitro or 
rray even ce adasc v/ne- tr^.e sub'jn:t containing fusion protein is alreaov 
bound to a support materia! in an affinity step. 

It iS also possib.e to express t.vo icr nncre> subun:t3 c^ the sa.nne connpiex 
each as a fusion protein with a different affinity tag. When the subunits 
associate tney can be detected, purif.ed pcssip: . tcgetne^ with othe^ 
ccnnpie< components cy a series of affin.tv steps in whicn each tirrie the 
complex IS pound via a clfferently tagged supunit. "ne two or more affinity 
tags can be fused with a Single subun.t cf a complex or with two or more 

■ ■ ^ I, ^ orh«='- a'-» simplv present in the same 

subunits wnich bind to eav-h o^ne. o. d. - ^ 

complex. 

The purification steps ca- be earned out as oescripec above. 

Some polypeptides are present in mors tnan one complex so tnat the 
components of ail com.piexes can be pu-;:-eC. 

,f one IS interested n a smgie oomoiex ,^ one ca^, aiso subtract othe^ 
complexes B that also contain one cf tne suPunits cf A pv fusing tnat 
subun-. o^ A to one tag ana fusing a subun.t umcue to 2 with a different 
tag Tne tacgec supun:t c^ 3 wiii end to a specific support material. If tne 
.action net pc.nc tc tn.: suppct matenal ^s used - t- second a^^nitv 
purification step c = ~c:ex E .-.11 no ic"ge' pe present cecause was 
removed .subtraoteC: m tne first aff.nitv stec. Manv s:-^: =' scenarios can 



be envisacec cnc; cesig-^e 



- . a perccn skinec :n the ar. 



3d2.t;cn 
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ng C3nr">ains tnat can be 
\ ccn/entiona: a' 



laa. i^-e^erao'v . tne second af^initv tag aansists a: leas: one ca:"':: 
c:ndina -eotice ;CBP'. Cain^oc:a';n D:nd:ng ceptice as an affinity tag 
Deen Gesnr:bed and is cc n-.n^erciaiiy avaiiaoie , Stratagene , . V\ne 
caimcaj' binding peptide is used the cor-esponcmg purification st. 
camec aut using a support matena! that is coatea with caimodui.n. 
ca::-^oau!:n binding peptide tag binds to calmodulin in the presence of 
concent-ations of calcium. It can suosequently be eluted using a one- 
agent s-^ch as a onelating agent iike EGTA. -re'erao y, arouno 2 - 
:s added for tne eiution step. 



Another aspect of the present invention is a fusion orotem consisting c: one 
or more ooiypeptices or subunits fused :o at ;east two affinity tags, wnerem 
one o^ tne affinity tags consists c" at ieast one tgG binding domain of 
Staphylococcus protein A. 
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Otner ^us:on o^otems acoorcmg to tne invention are tnose aGa:tiona, y 
includinq a oroteo lytic cleavage site, oreferaoiy to cieave o^f tne IgG oinamg 
domains, c^ those in which the second tag represents one or more C3Ps. 
Aqa:n, tne skilled person will be able to select and construct the most 

^^:S o' taas and c eavage sites and polypeptides and o^ 



' ^ b ' e CO m o ! n a 1 1 o ^ 



suounits m fusion proteins aep^ 



o-- tne afrinity strategy usee. 



fusion protein can oe co^.s 



:ructec so tnat the above-mentioned pun:: 



d e t e 0 : : o : 



■ r- r 1 C ^, ; 



■^c prcceoures oa' 



-ed out. 



i here are several possionities : 
0 r ' n 0 'i O' ' e : n e a f : i ^ 1 1 t a o s a 

r rr-.ir. 3[ e n 0 s c *" tne c- o 1 o e o t : a e s 
-n whion :he tags are *usec wit^ 
cnticai out can oe cnos^-^ accc' 



" 0 o n s t " u c 1 1 n o the fusion p r c t e ^. . 
• - ^ ; c s e t a e i t h e " t ^. e N - o ' 

5'^o^r^>t:s^ to oe exp^essec. ^ne c 
:ne oo:\ oeotioe: s : or suDunit^s- g 
^ * ^ : e a f ' : n ; : y p ^ c^ t o a c ' t o be ^ ; 



1 5 



25 



- 1 3 - 

ip,:erest internally (as long as the reading frame cn tne nucleic acta is no* 
crjnced:, =^-eferablv tre tags are iocatea near ts the same end tre 
pcivpept:de!s; or subunitis^, -.vherem it is esoeciai' . crefe-red tnat tre :g2 
binding domains are placed at the N- or C-term:nus of the complete f-jsic" 
protein, followed by a proteoivtic cleavage site, t-e otne- tag;s;. and the 
polypeptide(s) or subunit(s). 

The fusion protein can also contain a second proteolytic cleavage site for 
the removal of the second affinity tag. The m.ost preferable combination of 
affinity tags and cleavage sites is the one witn orotem A domains of 
Staphylococcus as the first affinity tag whicn can be cleaved off via tne 
TcV protease and using at least one calmodulin oindmg peptide as tne 
second affinity tag which allows the elution of the truncated fusion protein 
using a chelating agent such as 2 mM cGTA. 

Another aspect of the present invention is a heterologous nucleic acid 
coding for a fusion protein as the one described aocve. 

A further aspect of the invention is a vectc como'ismg at least one 
heterologous nucleic acid coding for a fusion protein of tne invention. This 
vector contains the nucleic acid under the control of sequences v,/hicn 
facilitate the expression of the fusion protein m a particular host cell o' ce:'- 
free system,. The control sequences comprise sequences such as promoters 
and. if necessary enhancers, Doly A sites etc. The prom,oter and other 
control sequences are selected so that tne fusion protem is preferabiv 
expressed at a basal level so that it is produced in soluble form and not as 
insoluble material. Preferaoly, tne fusion protem ;s also expressed m such 

^ - . , ^ ^ . , ^ * i^'^]r^.^ r n o n r 3 * e ; p to be in a native 

a 'vv a V' as to a 1 1 c ^ w - , e ^ . . ^ . ^ . , , . - . — ^ ^ 

con^orrr^ation. Pre^erab;.', one or rrcre seiectaoie -arke^s are aiso o^esen: 
cn tne vector for tne n^amtenance - prokarvctic euKa^votio ceils, 5a5:: 
cloning vectors are aesonce^ - Sar-.'zrooK et aL. MoieciJiar Comnc. 
Moiecula^ Clonm::. - ^ac::rj--:r, ^'3'^ca^ ^o,r ^o-'nc --a.^.." _3--. 



Press, n 939'. Exarr.ples of vectors are Cias-=;'oo o^, ■ • - 

o 3 r 5 3 n d t h S h k 6 . 

in a preferred embodir^ent vectors are c or^.str^ote o containing pre-r",aoe 
5 cassettes of affin,ty tag combinations into which tne nuo!eic acid coding fcr 
the polyoeptide or subunit of interest can be inserted by means of a rr.'.itiple 
cloning site such a polynucleotide linker. Thus, a vector according to tne 
mvention .s also one which does not contam the coding sequences for tne 
po'yoeptideis) or subun,t(s) of interest but contai-s the above-mentioned 
1 0 components plus one or more polynucleotide linkers with preferably unique 
restriction sites >n such a way that the msert.on of nucleic acid sequences 
according to conventional cloning methods into one of the sites in the 
polynucleotide linker leads to a vector encoding a fusion protein of tne 
invention. 

,n a further preferred embodiment the vector comprises heterologous 
nucleic acid sequences in form of two or more cassenes each com.prising 
_„e „f different affinity tags one consisting of one or more Igo 
'binding aomains of Staphylococcus aureus orotein A, and at least one 
20 polynucleotide linker for the insertion of further nucleic acids. Such a vector 
can be used to express two suounits of a protein complex, each taggec 
With a different tag. 



Victors accorc.na to the invention can oe introduced into nost ceiis stablv 
or trans.en.v, they can De presen: ex:rac-ron.cscn.alSy or integrated into 
the nost genon^e, and they can be used to produce recomo.nant ce::s o: 
organisms sucn as transgenic anin-.ais 



Anotner oo^ect c^ tne :nve-tio^ is ^ ^t;- i ^ 

^^t,-.- 'rpc-A rp>\'s ^^B^' be prokar'y-c 
3C\c or a e . - c . ^ i > - — ^ . , l , ^ 

eukarvctic ceils, e.g. nactenai cens. ..3^^ ^^-■^^ - ^ 
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v * r a n s ; e n L ; \- c\ c r e n t : c r-^ 2 ; 



:e:'s 3:2:: v 



:ou^:i :n S^^^b^cc^ et a. 



Ye: a f.rtner aspect of tre nve"::on :s 3 rsacer.: k,: preferao^y c = mp^s:n5 
5 vectors as described above tcgetner w,th support n-,ater,als for carrv:^.g 

the affin;:Y steps. The support materials carry mo,et:es which are capa.ie 
ofspeo,fioallybind,ngtheaff:n,tvtags.forexample,ca,mcdunn-coatec resin 

m the case of caln.odul.r^ bir^amg oeptide as the af^in:ty tag or IgG-coated 
r.s.ns for affinity tags consisting of protein A dorr^ains, Aodit.onally , such 

10 a kit may comprise buffers and other conventional r^atenals for protein 
purification, especially for affinity chromatograohy . Further, the kit 
preferably provides at least one proteolytic agent such as a chemical agent 
capable of performing proteolysis or a protease and.or cnemicai agents such 
as chelating agents, wherein the protease ,s capable of proteolyticaily 

1 5 cleaving the fusion protein. When two proteolytic cleavage sites are usee 
the kit will preferably contain two proteases. 

I c;^.,r = c co'wp illustrate the invention and its 

Tne following Examples anc Figures se. .= ".ub. 

■• -hpv aro no-v^v-r not Intend ed tc limit the scope cf 

practical application, tney ar-, n^.j^j^>. 

20 the invention. 



snows a Cccmassie stai-ed gel decicting tne fract.onatec 
proteins of a yeast RN A-prote,n complex. Proteins loentifiec b . 
r^nass scectrometr , are laceied 1-24. Bands 1 , 2, S- 1 M 5-2- 
v.ere expected m this comoiex. Bands 2. 4-7 represent 
proteins that are I^Keiv candicates for true complex componen* 
given the:^ secuence. Bancs 12-14 represent potentia 
co-tam.nants nccsc-a. c'Otems. Banc " 5 :s a trace amcu- 
of TE\' orotease t-a: rem3:nec :n tms carticuia^ c^eca'at c- 
Tnis IS ret genera.v tne case ^see Fig. 2 for examnie:. Banc 
4-6 originate -ci- tne same gene an^ m:g- -ecreser^ 
a.ter-a::ve :ra-^a- - C'ca.cts cr ceg-aca-c" cr:cua:r Ba- 



- 1 D - 

16 :s a msxTure ana contain; 
P ^ --^ n :2 ^ c t e ; n , a c o n : a m i n 3 1 



a:::::::. on to a be 
;ooso:^,al protein. 



10 



shov.s a Coon-:assie sta;ned ge: aepior.ng tne fractionateo 
proteins n-,oiety of the yeast U1 snRN?. Tne 10 specif:c 
proteins were identified Dy nnass soectron^ietry. Compare vvitn 
the silver stained gei obtained following a purification using a 
cao affinity step and a Ni-NTA coiumn reported by Neubauer 
et a!., (Proc.Nati.Acad.Sci USA 1 997. 9^, 335-390;. Note in 
particular tne low level of contaminants in th;s purification. 
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Fig. 3 



: >: 3 o I e s 



shows 3 Coonnassie stained gei in whicn purified Ul snRN- 
has been analysed using either the CBP Oinding.'EGTA elution 
alone (lane 1 the Protein A binding TEV elution alone (lane 3. 
or both steps (lane 7). Arrows on the right point to the Ul 
snRNP specific proteins. Lanes 2, 4 and 8 show the 
background signal obtained using each of the single steps or 
the two ster^ procedure fronn an extract without taggeci 
protein, demonstrating the requirement for two steps to get a 
pure material. Lane 6 is a molecular weight marker. Lane 7 is 
the TEV protease that can also be seen as an abundant 
cc-ta^ma^t feven though oniy the required amount 
protease was used) in lanes 3 and 4. Tnis dem.onstrates again 
The need for a second purification step. 



'^0 



r^ur: 



ccmO'exes from veas* 



A vector encocmg a fusic^ c' a veas: orc:e'^ to z^e C 3"- i E V- i^'r : te:^ A 
c o 'J b ! 0^ t a 0 /; as constructed lj s ; c s t a a a r d e t r c^ a s . T'^.e " j s : o a ^ >c : ! s 
one suGur^.t of a oroteir^ ccnapiex c^ '/east conta;ni^a 24 suDunits i-^i tota.. 
Tpe pias-o-io transformed r^to veast cells anc 3 2^ culture c' ce;;5 

5 expressing tne prote;n was nreoared. Proteins we^e e<tracted from the 
cultured cells using a French press. Tne complex was purified by binaing to 
IqG-linked oeacs, eluting by TEV protease cieavage binding of tne elutea 
material on calmodulin containing beads foiiowed bw elution with EGTA. A!' 
steO'S were earned out at 0-4^C. excepted for TtV cieavage. 

1 0 

The first affinity step (IgG step^ was performed as follows: 
200 aI o: IgG-Sepharose bead suspension (Pnarmacia :7-0S59-0i ' w^ere 
washed ;n an Econo column with 5 m.i of IP^ 1 50- IgG buffer ! 1 0 mT/ Tris-CI 
pH 8.0, 150 mM NaCi, 0.1 NP40;. 10 ml extract, corresponding 

1 5 approximateiy to 2 L of yeast culture, were adjustea to IPP 1 50-IgG buffer 
concentrations in Tris-Ci pH 8.0, NaCi and NP40. This e>tract solution was 
mixed with the 200 ;j\ of IgG-Sepharose beacs and rotated in the 
Econooolumn for 2 hours. Tne unoound fraction was ciscardea and oeads 
with bound m.atenal were wasned first witn 30 m.i IPP 1 50-igG butfe^ 

20 followed by 10 ml TEV cleavage buffer (10 mM Tris-Ci pH 3.0, 150 mM 
NaCI, 0.1^0 NP40, 0.5 mM EDTA, 1 mM DTTK 

The target crotein was cieaved ana 
washed Econocoiumn was fiiied 
25 TEV protease and "stated in a ^5 
recovered bv gravity fiovv'. 

The second a'^mitv stec fOa:^'.; 
follows: 

30 The orevicus eluate was m,:>.ea 
buffe" ■■'''0 m*.^ i'-mercactcet^.a^c 
1 m'/ r/ c - a c e: c t e . ^ m',"! im:c; 



"e'eased from the beads as follows, i ne 
-m. ^; rr.'. TEV cieavage ouffer anc 30 p'l 
Z incucati^ *cr 2 hours. The e:uate ^-v a 
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v.'as f'jrthe" aaaed t r zicck the ED i A ccn^,.^. c: 
J f f e ' ^ n ^ 3 i " as rctated for 1 hour in ar 
Econocoiumn contain, rg 200 a: cf Ca:~.oc.;;n ceads siurry (Stratagere 
214303) previcusiy wasnea with 5 rr.l !P= 1 50-Ca:~cdulin binding buffer. 

After washing with 30 rr,i of IPP 1 50-Cain-.odulin omding buffer, protein 
corT^piexes were eiu:e= with 5 success;ve additions of 200 ;j\ of IPP 150- 
Cair7iocui:n eiution buffer (10 rr.M ^-mercantoethano!, 10 mM Tns-Cl pH 
B.C. 150mMNaC!, 1 mr.1 r/.g-acetate, 1 r-iM inn.idazole, 2 mM EGTA, 0. 1 °o 
10 NP40). 

Sarr.ples were frozen ,n dry ice and stored at -30°C. Proteins were 
concentrated by TCA precipitation (A. Bensadoun and D . VVeinstein ( 1 975i . 
Anal. Biochem. 70, 241;. The oroteins were detected by poiyacrylamide ge- 
electrophoresis with subsequent staining of the ge! with Coomassie blue. 
The result of the protein purification, a gel of which is depicted in Fig.1 
demonstrates that the strategy employed is highly efficient. All the 
expected orotein subumts which number 24 in this case can be detected. 



20 Example 2 

The sam.e p-ocedu-e was used for two other protein or prctein-RNA 
complex.es frcr-, yeas: where aV expected protein subunits were detected 
using the method c*: tne invention. Those are the CBC (Cap Binding 

25 Complex) and the U1 snPNP. Tne pun-'ied U 1 snR.NP is depicted m Figures 
2 and 3 Tne CBC comoiex nas been shown to be stiil active and the purity 
was good in all cases. Tnis method is relatively cneao and not very time- 
consuming, since :t ca-^, De cone :n one oa.. Concerning the U1 snRN== I 
IS note .vcth / that Ne..caue' et ai. .^rcc. Nat;, Acad. Sc.. 'dS^ 1:^;^. , 9- 

30 3S5-390' earned cu: a cu^.^ication ct tne S3m,e ccm.olex (U1 snPr^F: 

. -^-^^ r-r.-'ains o' the ccmoiex cMnterest we'e 

extracted *ror^ ;c^-,----r-_. — .c^.... 
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- n ^li r]r^. 



V visioie bv silve' stainina and several contaminants 



